Data driven estimates for mixtures
نویسندگان
چکیده
Data with asymmetric heavy tails can arise from mixture of data from multiple populations or processes. We propose a computer intensive procedure to fit by quasi-maximum likelihood a mixture model to a robustly standardized data set. The robust standardization of the data set results in well defined tails which are modeled using extreme value theory. The data are assumed to be a mixture of a normal distribution contaminated by a distribution with heavy tails. This procedure provides an analytical expression for the mixture distribution of the data, which may be used in simulations and construction of scenarios, while providing an accurate estimation of quantiles associated with probabilities close to zero or one. The performance of the proposed data driven procedure is assessed by simulation experiments and also by its application to real data.
منابع مشابه
A Model-Driven Decision Support System for Software Cost Estimation (Case Study: Projects in NASA60 Dataset)
Estimating the costs of software development is one of the most important activities in software project management. Inaccuracies in such estimates may cause irreparable loss. A low estimate of the cost of projects will result in failure on delivery on time and indicates the inefficiency of the software development team. On the other hand, high estimates of resources and costs for a project wil...
متن کاملAxial and Torsional Free Vibrations of Elastic Nano-Beams by Stress-Driven Two-Phase Elasticity
Size-dependent longitudinal and torsional vibrations of nano-beams are examined by two-phase mixture integral elasticity. A new and efficient elastodynamic model is conceived by convexly combining the local phase with strain- and stress-driven purely nonlocal phases. The proposed stress-driven nonlocal integral mixture leads to well-posed structural problems for any value of the scale parameter...
متن کاملApplication of Adaptive Mixtures and Fractal Dimension Analysis Technique to Particle Physics
The discrimination of physics “signal” from “background” is one of the most important subjects in high energy physics analysis since this process usually governs the magnitude of measurement errors. Background suppression using kernel density estimation to estimate the parent distribution of a data sample appears to be an effective method. In this paper, Adaptive Mixtures [1] and Kernel Density...
متن کاملA Data-driven Method for Crowd Simulation using a Holonification Model
In this paper, we present a data-driven method for crowd simulation with holonification model. With this extra module, the accuracy of simulation will increase and it generates more realistic behaviors of agents. First, we show how to use the concept of holon in crowd simulation and how effective it is. For this reason, we use simple rules for holonification. Using real-world data, we model the...
متن کاملData and Methods for the Production of National Population Estimates: An Overview and Analysis of Available Metadata
Thomas Spoorenberg Translated by: Elham Fathi Statistical Center of Iran Abstract. Official population estimates can be produced using a variety of data sources and methods. These range from the direct extraction of information from continuously updated population registers to procedures for updating the status of a population enumerated previously in a periodic census. Additional sources and ...
متن کاملنمونهگیری پاسخگو محور در مقایسه با سایر روشهای نمونهگیری از جوامع پنهان
Sampling hidden populations is challenging due to the lack of convenience statistical frames. Since most populations exposed to special diseases are hidden and hard to reach, sampling methods that produce representative and efficient samples from the populations have become a study subject for researches all over the world. Because of the unknown probability of selecting samples in conventional...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 47 شماره
صفحات -
تاریخ انتشار 2004